Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 260503 |
| Missing cells | 1442 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 188.1 MiB |
| Average record size in memory | 757.3 B |
Variable types
| Numeric | 9 |
|---|---|
| DateTime | 1 |
| Text | 4 |
| Categorical | 5 |
ARREST_BORO is highly overall correlated with ARREST_PRECINCT and 2 other fields | High correlation |
ARREST_PRECINCT is highly overall correlated with ARREST_BORO | High correlation |
KY_CD is highly overall correlated with LAW_CAT_CD | High correlation |
LAW_CAT_CD is highly overall correlated with KY_CD | High correlation |
Latitude is highly overall correlated with Y_COORD_CD | High correlation |
Longitude is highly overall correlated with X_COORD_CD | High correlation |
X_COORD_CD is highly overall correlated with ARREST_BORO and 1 other fields | High correlation |
Y_COORD_CD is highly overall correlated with ARREST_BORO and 1 other fields | High correlation |
LAW_CAT_CD is highly imbalanced (58.1%) | Imbalance |
Latitude is highly skewed (γ1 = -139.370098) | Skewed |
Longitude is highly skewed (γ1 = 154.9083432) | Skewed |
ARREST_KEY has unique values | Unique |
JURISDICTION_CODE has 224017 (86.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-02-14 03:19:36.314658 |
|---|---|
| Analysis finished | 2025-02-14 03:20:11.705014 |
| Duration | 35.39 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
ARREST_KEY
Real number (ℝ)
Unique 
| Distinct | 260503 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8939828 × 108 |
| Minimum | 2.7976351 × 108 |
|---|---|
| Maximum | 2.9874848 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 2.7976351 × 108 |
|---|---|
| 5-th percentile | 2.8080923 × 108 |
| Q1 | 2.8472986 × 108 |
| median | 2.8945965 × 108 |
| Q3 | 2.9410416 × 108 |
| 95-th percentile | 2.9778914 × 108 |
| Maximum | 2.9874848 × 108 |
| Range | 18984975 |
| Interquartile range (IQR) | 9374306 |
Descriptive statistics
| Standard deviation | 5439968.3 |
|---|---|
| Coefficient of variation (CV) | 0.018797514 |
| Kurtosis | -1.1865378 |
| Mean | 2.8939828 × 108 |
| Median Absolute Deviation (MAD) | 4683767 |
| Skewness | -0.028753483 |
| Sum | 7.5389121 × 1013 |
| Variance | 2.9593255 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 281369711 | 1 | < 0.1% |
| 298661797 | 1 | < 0.1% |
| 290501031 | 1 | < 0.1% |
| 290234642 | 1 | < 0.1% |
| 290099583 | 1 | < 0.1% |
| 290650670 | 1 | < 0.1% |
| 289740038 | 1 | < 0.1% |
| 297652616 | 1 | < 0.1% |
| 289459427 | 1 | < 0.1% |
| 297069918 | 1 | < 0.1% |
| Other values (260493) | 260493 |
| Value | Count | Frequency (%) |
| 279763507 | 1 | |
| 279763792 | 1 | |
| 279763800 | 1 | |
| 279764505 | 1 | |
| 279764507 | 1 | |
| 279764510 | 1 | |
| 279764511 | 1 | |
| 279764512 | 1 | |
| 279764513 | 1 | |
| 279764515 | 1 |
| Value | Count | Frequency (%) |
| 298748482 | 1 | |
| 298725483 | 1 | |
| 298711176 | 1 | |
| 298711173 | 1 | |
| 298711171 | 1 | |
| 298711170 | 1 | |
| 298710745 | 1 | |
| 298710741 | 1 | |
| 298710736 | 1 | |
| 298710721 | 1 |
ARREST_DATE
Date
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| Minimum | 2024-01-01 00:00:00 |
|---|---|
| Maximum | 2024-12-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
PD_CD
Real number (ℝ)
| Distinct | 267 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 431.64201 |
| Minimum | 2 |
|---|---|
| Maximum | 997 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 101 |
| Q1 | 117 |
| median | 397 |
| Q3 | 705 |
| 95-th percentile | 922 |
| Maximum | 997 |
| Range | 995 |
| Interquartile range (IQR) | 588 |
Descriptive statistics
| Standard deviation | 271.55787 |
|---|---|
| Coefficient of variation (CV) | 0.62912754 |
| Kurtosis | -1.1382518 |
| Mean | 431.64201 |
| Median Absolute Deviation (MAD) | 283 |
| Skewness | 0.3297609 |
| Sum | 1.1244058 × 108 |
| Variance | 73743.679 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 101 | 28202 | 10.8% |
| 339 | 27107 | 10.4% |
| 109 | 15612 | 6.0% |
| 922 | 13254 | 5.1% |
| 478 | 12265 | 4.7% |
| 397 | 11966 | 4.6% |
| 779 | 9983 | 3.8% |
| 439 | 9739 | 3.7% |
| 511 | 7755 | 3.0% |
| 113 | 6499 | 2.5% |
| Other values (257) | 118113 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 15 | 43 | < 0.1% |
| 16 | 169 | 0.1% |
| 29 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| 35 | 7 | < 0.1% |
| 49 | 1165 | 0.4% |
| 100 | 1 | < 0.1% |
| 101 | 28202 |
| Value | Count | Frequency (%) |
| 997 | 2 | < 0.1% |
| 973 | 1 | < 0.1% |
| 972 | 7 | < 0.1% |
| 969 | 2282 | |
| 968 | 62 | < 0.1% |
| 965 | 1 | < 0.1% |
| 963 | 2 | < 0.1% |
| 947 | 1 | < 0.1% |
| 940 | 67 | < 0.1% |
| 939 | 2 | < 0.1% |
PD_DESC
Text
| Distinct | 257 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.4 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 52 |
| Mean length | 25.094571 |
| Min length | 6 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SEXUAL ABUSE |
|---|---|
| 2nd row | STRANGULATION 1ST |
| 3rd row | STRANGULATION 1ST |
| 4th row | STRANGULATION 1ST |
| 5th row | JOSTLING |
| Value | Count | Frequency (%) |
| assault | 46929 | 6.5% |
| 3 | 42183 | 5.9% |
| from | 38588 | 5.4% |
| open | 36846 | 5.1% |
| areas | 36846 | 5.1% |
| larceny,petit | 27107 | 3.8% |
| criminal | 18791 | 2.6% |
| controlled | 17987 | 2.5% |
| 2,1,unclassified | 15612 | 2.2% |
| traffic,unclassified | 15536 | 2.2% |
| Other values (383) | 422955 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 618697 | 9.5% |
| E | 586679 | 9.0% |
| A | 568469 | 8.7% |
| I | 476387 | 7.3% |
| N | 468740 | 7.2% |
| 464544 | 7.1% | |
| R | 366535 | 5.6% |
| T | 341412 | 5.2% |
| L | 328059 | 5.0% |
| C | 313321 | 4.8% |
| Other values (32) | 2004368 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6537211 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 618697 | 9.5% |
| E | 586679 | 9.0% |
| A | 568469 | 8.7% |
| I | 476387 | 7.3% |
| N | 468740 | 7.2% |
| 464544 | 7.1% | |
| R | 366535 | 5.6% |
| T | 341412 | 5.2% |
| L | 328059 | 5.0% |
| C | 313321 | 4.8% |
| Other values (32) | 2004368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6537211 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 618697 | 9.5% |
| E | 586679 | 9.0% |
| A | 568469 | 8.7% |
| I | 476387 | 7.3% |
| N | 468740 | 7.2% |
| 464544 | 7.1% | |
| R | 366535 | 5.6% |
| T | 341412 | 5.2% |
| L | 328059 | 5.0% |
| C | 313321 | 4.8% |
| Other values (32) | 2004368 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6537211 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 618697 | 9.5% |
| E | 586679 | 9.0% |
| A | 568469 | 8.7% |
| I | 476387 | 7.3% |
| N | 468740 | 7.2% |
| 464544 | 7.1% | |
| R | 366535 | 5.6% |
| T | 341412 | 5.2% |
| L | 328059 | 5.0% |
| C | 313321 | 4.8% |
| Other values (32) | 2004368 |
KY_CD
Real number (ℝ)
High correlation 
| Distinct | 70 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 252.54496 |
| Minimum | 101 |
|---|---|
| Maximum | 995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 105 |
| Q1 | 114 |
| median | 341 |
| Q3 | 344 |
| 95-th percentile | 359 |
| Maximum | 995 |
| Range | 894 |
| Interquartile range (IQR) | 230 |
Descriptive statistics
| Standard deviation | 144.94242 |
|---|---|
| Coefficient of variation (CV) | 0.5739272 |
| Kurtosis | 5.1811136 |
| Mean | 252.54496 |
| Median Absolute Deviation (MAD) | 106 |
| Skewness | 1.49722 |
| Sum | 65780637 |
| Variance | 21008.305 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 344 | 38238 | |
| 341 | 27107 | 10.4% |
| 106 | 22606 | 8.7% |
| 126 | 16360 | 6.3% |
| 348 | 13783 | 5.3% |
| 343 | 12621 | 4.8% |
| 105 | 12020 | 4.6% |
| 109 | 11804 | 4.5% |
| 117 | 10424 | 4.0% |
| 359 | 8712 | 3.3% |
| Other values (60) | 86796 |
| Value | Count | Frequency (%) |
| 101 | 1601 | 0.6% |
| 102 | 8 | < 0.1% |
| 103 | 60 | < 0.1% |
| 104 | 792 | 0.3% |
| 105 | 12020 | |
| 106 | 22606 | |
| 107 | 6450 | 2.5% |
| 109 | 11804 | |
| 110 | 2086 | 0.8% |
| 111 | 2427 | 0.9% |
| Value | Count | Frequency (%) |
| 995 | 1390 | |
| 882 | 2 | < 0.1% |
| 881 | 2352 | |
| 880 | 78 | < 0.1% |
| 685 | 2 | < 0.1% |
| 678 | 181 | 0.1% |
| 677 | 2192 | |
| 676 | 1 | < 0.1% |
| 675 | 139 | 0.1% |
| 672 | 2 | < 0.1% |
OFNS_DESC
Text
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.1 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 20.019036 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SEX CRIMES |
|---|---|
| 2nd row | FELONY ASSAULT |
| 3rd row | FELONY ASSAULT |
| 4th row | FELONY ASSAULT |
| 5th row | JOSTLING |
| Value | Count | Frequency (%) |
| offenses | 67295 | 8.3% |
| related | 63764 | 7.9% |
| assault | 60844 | 7.5% |
| 59138 | 7.3% | |
| larceny | 40997 | 5.1% |
| 3 | 38250 | 4.7% |
| dangerous | 29299 | 3.6% |
| petit | 27107 | 3.4% |
| felony | 22606 | 2.8% |
| other | 20756 | 2.6% |
| Other values (102) | 376376 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 593170 | |
| 545929 | ||
| A | 466154 | 8.9% |
| S | 456542 | 8.8% |
| L | 339538 | 6.5% |
| N | 316131 | 6.1% |
| R | 314901 | 6.0% |
| T | 313796 | 6.0% |
| O | 266257 | 5.1% |
| F | 256252 | 4.9% |
| Other values (30) | 1346349 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5215019 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 593170 | |
| 545929 | ||
| A | 466154 | 8.9% |
| S | 456542 | 8.8% |
| L | 339538 | 6.5% |
| N | 316131 | 6.1% |
| R | 314901 | 6.0% |
| T | 313796 | 6.0% |
| O | 266257 | 5.1% |
| F | 256252 | 4.9% |
| Other values (30) | 1346349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5215019 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 593170 | |
| 545929 | ||
| A | 466154 | 8.9% |
| S | 456542 | 8.8% |
| L | 339538 | 6.5% |
| N | 316131 | 6.1% |
| R | 314901 | 6.0% |
| T | 313796 | 6.0% |
| O | 266257 | 5.1% |
| F | 256252 | 4.9% |
| Other values (30) | 1346349 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5215019 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 593170 | |
| 545929 | ||
| A | 466154 | 8.9% |
| S | 456542 | 8.8% |
| L | 339538 | 6.5% |
| N | 316131 | 6.1% |
| R | 314901 | 6.0% |
| T | 313796 | 6.0% |
| O | 266257 | 5.1% |
| F | 256252 | 4.9% |
| Other values (30) | 1346349 |
LAW_CODE
Text
| Distinct | 1151 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9998388 |
| Min length | 6 |
Unique
| Unique | 268 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | PL 1306501 |
|---|---|
| 2nd row | PL 1211200 |
| 3rd row | PL 1211200 |
| 4th row | PL 1211200 |
| 5th row | PL 1652501 |
| Value | Count | Frequency (%) |
| pl | 235298 | |
| 1200001 | 27625 | 5.6% |
| 1552500 | 27107 | 5.5% |
| 1651503 | 11915 | 2.4% |
| vtl0511001 | 8786 | 1.8% |
| 215510b | 8462 | 1.7% |
| 2200300 | 7755 | 1.6% |
| 1200502 | 7684 | 1.5% |
| 1553001 | 6231 | 1.3% |
| 1201401 | 5372 | 1.1% |
| Other values (1146) | 150255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 667104 | |
| 1 | 410966 | |
| 5 | 276271 | |
| L | 257787 | 9.9% |
| P | 236061 | 9.1% |
| 235987 | 9.1% | |
| 2 | 232882 | 8.9% |
| 3 | 61257 | 2.4% |
| 6 | 59723 | 2.3% |
| 4 | 43994 | 1.7% |
| Other values (31) | 122956 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2604988 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 667104 | |
| 1 | 410966 | |
| 5 | 276271 | |
| L | 257787 | 9.9% |
| P | 236061 | 9.1% |
| 235987 | 9.1% | |
| 2 | 232882 | 8.9% |
| 3 | 61257 | 2.4% |
| 6 | 59723 | 2.3% |
| 4 | 43994 | 1.7% |
| Other values (31) | 122956 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2604988 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 667104 | |
| 1 | 410966 | |
| 5 | 276271 | |
| L | 257787 | 9.9% |
| P | 236061 | 9.1% |
| 235987 | 9.1% | |
| 2 | 232882 | 8.9% |
| 3 | 61257 | 2.4% |
| 6 | 59723 | 2.3% |
| 4 | 43994 | 1.7% |
| Other values (31) | 122956 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2604988 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 667104 | |
| 1 | 410966 | |
| 5 | 276271 | |
| L | 257787 | 9.9% |
| P | 236061 | 9.1% |
| 235987 | 9.1% | |
| 2 | 232882 | 8.9% |
| 3 | 61257 | 2.4% |
| 6 | 59723 | 2.3% |
| 4 | 43994 | 1.7% |
| Other values (31) | 122956 | 4.7% |
LAW_CAT_CD
Categorical
High correlation  Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1390 |
| Missing (%) | 0.5% |
| Memory size | 14.4 MiB |
| M | |
|---|---|
| F | |
| V | 2233 |
| 9 | 734 |
| I | 226 |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001544 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 146772 | |
| F | 109140 | |
| V | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| I | 226 | 0.1% |
| (null) | 8 | < 0.1% |
| (Missing) | 1390 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 146772 | |
| f | 109140 | |
| v | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| i | 226 | 0.1% |
| null | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 146772 | |
| F | 109140 | |
| V | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| I | 226 | 0.1% |
| l | 16 | < 0.1% |
| ( | 8 | < 0.1% |
| n | 8 | < 0.1% |
| u | 8 | < 0.1% |
| ) | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 259153 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 146772 | |
| F | 109140 | |
| V | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| I | 226 | 0.1% |
| l | 16 | < 0.1% |
| ( | 8 | < 0.1% |
| n | 8 | < 0.1% |
| u | 8 | < 0.1% |
| ) | 8 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 259153 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 146772 | |
| F | 109140 | |
| V | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| I | 226 | 0.1% |
| l | 16 | < 0.1% |
| ( | 8 | < 0.1% |
| n | 8 | < 0.1% |
| u | 8 | < 0.1% |
| ) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 259153 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 146772 | |
| F | 109140 | |
| V | 2233 | 0.9% |
| 9 | 734 | 0.3% |
| I | 226 | 0.1% |
| l | 16 | < 0.1% |
| ( | 8 | < 0.1% |
| n | 8 | < 0.1% |
| u | 8 | < 0.1% |
| ) | 8 | < 0.1% |
ARREST_BORO
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.4 MiB |
| K | |
|---|---|
| M | |
| B | |
| Q | |
| S |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | B |
| 3rd row | M |
| 4th row | K |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| K | 72325 | |
| M | 61969 | |
| B | 58521 | |
| Q | 56633 | |
| S | 11055 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| k | 72325 | |
| m | 61969 | |
| b | 58521 | |
| q | 56633 | |
| s | 11055 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 72325 | |
| M | 61969 | |
| B | 58521 | |
| Q | 56633 | |
| S | 11055 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| K | 72325 | |
| M | 61969 | |
| B | 58521 | |
| Q | 56633 | |
| S | 11055 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| K | 72325 | |
| M | 61969 | |
| B | 58521 | |
| Q | 56633 | |
| S | 11055 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| K | 72325 | |
| M | 61969 | |
| B | 58521 | |
| Q | 56633 | |
| S | 11055 | 4.2% |
ARREST_PRECINCT
Real number (ℝ)
High correlation 
| Distinct | 79 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63.410936 |
| Minimum | 1 |
|---|---|
| Maximum | 483 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 40 |
| median | 63 |
| Q3 | 101 |
| 95-th percentile | 115 |
| Maximum | 483 |
| Range | 482 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 34.955962 |
|---|---|
| Coefficient of variation (CV) | 0.55126078 |
| Kurtosis | -0.9341508 |
| Mean | 63.410936 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.067836522 |
| Sum | 16518739 |
| Variance | 1221.9193 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 9887 | 3.8% |
| 75 | 8675 | 3.3% |
| 40 | 8389 | 3.2% |
| 103 | 7983 | 3.1% |
| 44 | 7690 | 3.0% |
| 46 | 6605 | 2.5% |
| 110 | 6440 | 2.5% |
| 73 | 5727 | 2.2% |
| 120 | 5535 | 2.1% |
| 18 | 5487 | 2.1% |
| Other values (69) | 188085 |
| Value | Count | Frequency (%) |
| 1 | 3419 | 1.3% |
| 5 | 3640 | 1.4% |
| 6 | 2285 | 0.9% |
| 7 | 2351 | 0.9% |
| 9 | 1813 | 0.7% |
| 10 | 1961 | 0.8% |
| 13 | 3804 | 1.5% |
| 14 | 9887 | |
| 17 | 1168 | 0.4% |
| 18 | 5487 |
| Value | Count | Frequency (%) |
| 483 | 3 | < 0.1% |
| 123 | 1052 | 0.4% |
| 122 | 1625 | 0.6% |
| 121 | 2843 | |
| 120 | 5535 | |
| 116 | 111 | < 0.1% |
| 115 | 5384 | |
| 114 | 4173 | |
| 113 | 5201 | |
| 112 | 1891 | 0.7% |
JURISDICTION_CODE
Real number (ℝ)
Zeros 
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.90454237 |
| Minimum | 0 |
|---|---|
| Maximum | 97 |
| Zeros | 224017 |
| Zeros (%) | 86.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 6.8823177 |
|---|---|
| Coefficient of variation (CV) | 7.6086185 |
| Kurtosis | 130.05739 |
| Mean | 0.90454237 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.102277 |
| Sum | 235636 |
| Variance | 47.366298 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 224017 | |
| 1 | 19880 | 7.6% |
| 2 | 9920 | 3.8% |
| 3 | 2119 | 0.8% |
| 17 | 2012 | 0.8% |
| 72 | 561 | 0.2% |
| 97 | 473 | 0.2% |
| 73 | 397 | 0.2% |
| 11 | 263 | 0.1% |
| 51 | 202 | 0.1% |
| Other values (15) | 659 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 224017 | |
| 1 | 19880 | 7.6% |
| 2 | 9920 | 3.8% |
| 3 | 2119 | 0.8% |
| 4 | 81 | < 0.1% |
| 7 | 122 | < 0.1% |
| 11 | 263 | 0.1% |
| 12 | 10 | < 0.1% |
| 13 | 10 | < 0.1% |
| 14 | 107 | < 0.1% |
| Value | Count | Frequency (%) |
| 97 | 473 | |
| 88 | 17 | < 0.1% |
| 87 | 112 | < 0.1% |
| 85 | 5 | < 0.1% |
| 79 | 6 | < 0.1% |
| 76 | 1 | < 0.1% |
| 74 | 2 | < 0.1% |
| 73 | 397 | |
| 72 | 561 | |
| 71 | 106 | < 0.1% |
AGE_GROUP
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.4 MiB |
| 25-44 | |
|---|---|
| 45-64 | |
| 18-24 | |
| <18 | 9525 |
| 65+ | 4649 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.8911798 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25-44 |
|---|---|
| 2nd row | 25-44 |
| 3rd row | 25-44 |
| 4th row | 25-44 |
| 5th row | 18-24 |
Common Values
| Value | Count | Frequency (%) |
| 25-44 | 152034 | |
| 45-64 | 51121 | 19.6% |
| 18-24 | 43174 | 16.6% |
| <18 | 9525 | 3.7% |
| 65+ | 4649 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 25-44 | 152034 | |
| 45-64 | 51121 | 19.6% |
| 18-24 | 43174 | 16.6% |
| 18 | 9525 | 3.7% |
| 65 | 4649 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 449484 | |
| - | 246329 | |
| 5 | 207804 | |
| 2 | 195208 | |
| 6 | 55770 | 4.4% |
| 1 | 52699 | 4.1% |
| 8 | 52699 | 4.1% |
| < | 9525 | 0.7% |
| + | 4649 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1274167 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 449484 | |
| - | 246329 | |
| 5 | 207804 | |
| 2 | 195208 | |
| 6 | 55770 | 4.4% |
| 1 | 52699 | 4.1% |
| 8 | 52699 | 4.1% |
| < | 9525 | 0.7% |
| + | 4649 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1274167 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 449484 | |
| - | 246329 | |
| 5 | 207804 | |
| 2 | 195208 | |
| 6 | 55770 | 4.4% |
| 1 | 52699 | 4.1% |
| 8 | 52699 | 4.1% |
| < | 9525 | 0.7% |
| + | 4649 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1274167 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 449484 | |
| - | 246329 | |
| 5 | 207804 | |
| 2 | 195208 | |
| 6 | 55770 | 4.4% |
| 1 | 52699 | 4.1% |
| 8 | 52699 | 4.1% |
| < | 9525 | 0.7% |
| + | 4649 | 0.4% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 213587 | |
| F | 46916 | 18.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 213587 | |
| f | 46916 | 18.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 213587 | |
| F | 46916 | 18.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 213587 | |
| F | 46916 | 18.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 213587 | |
| F | 46916 | 18.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 260503 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 213587 | |
| F | 46916 | 18.0% |
PERP_RACE
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| BLACK | |
|---|---|
| WHITE HISPANIC | |
| BLACK HISPANIC | |
| WHITE | |
| ASIAN / PACIFIC ISLANDER | |
| Other values (2) | 1775 |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 9.4737642 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BLACK |
|---|---|
| 2nd row | BLACK |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | WHITE |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 122049 | |
| WHITE HISPANIC | 69131 | |
| BLACK HISPANIC | 26549 | 10.2% |
| WHITE | 26161 | 10.0% |
| ASIAN / PACIFIC ISLANDER | 14838 | 5.7% |
| UNKNOWN | 956 | 0.4% |
| AMERICAN INDIAN/ALASKAN NATIVE | 819 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| black | 148598 | |
| hispanic | 95680 | |
| white | 95292 | |
| asian | 14838 | 3.7% |
| 14838 | 3.7% | |
| pacific | 14838 | 3.7% |
| islander | 14838 | 3.7% |
| unknown | 956 | 0.2% |
| american | 819 | 0.2% |
| indian/alaskan | 819 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 349280 | |
| A | 309363 | |
| C | 274773 | |
| H | 190972 | 7.7% |
| L | 164255 | 6.7% |
| K | 150373 | 6.1% |
| B | 148598 | 6.0% |
| 141832 | 5.7% | |
| N | 132319 | 5.4% |
| S | 126175 | 5.1% |
| Other values (12) | 480004 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2467944 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 349280 | |
| A | 309363 | |
| C | 274773 | |
| H | 190972 | 7.7% |
| L | 164255 | 6.7% |
| K | 150373 | 6.1% |
| B | 148598 | 6.0% |
| 141832 | 5.7% | |
| N | 132319 | 5.4% |
| S | 126175 | 5.1% |
| Other values (12) | 480004 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2467944 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 349280 | |
| A | 309363 | |
| C | 274773 | |
| H | 190972 | 7.7% |
| L | 164255 | 6.7% |
| K | 150373 | 6.1% |
| B | 148598 | 6.0% |
| 141832 | 5.7% | |
| N | 132319 | 5.4% |
| S | 126175 | 5.1% |
| Other values (12) | 480004 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2467944 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 349280 | |
| A | 309363 | |
| C | 274773 | |
| H | 190972 | 7.7% |
| L | 164255 | 6.7% |
| K | 150373 | 6.1% |
| B | 148598 | 6.0% |
| 141832 | 5.7% | |
| N | 132319 | 5.4% |
| S | 126175 | 5.1% |
| Other values (12) | 480004 |
X_COORD_CD
Real number (ℝ)
High correlation 
| Distinct | 31272 |
|---|---|
| Distinct (%) | 12.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005551.9 |
| Minimum | 0 |
|---|---|
| Maximum | 1067220 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 979434 |
| Q1 | 990796 |
| median | 1005257 |
| Q3 | 1017771 |
| 95-th percentile | 1041920 |
| Maximum | 1067220 |
| Range | 1067220 |
| Interquartile range (IQR) | 26975 |
Descriptive statistics
| Standard deviation | 22036.796 |
|---|---|
| Coefficient of variation (CV) | 0.021915125 |
| Kurtosis | 166.99329 |
| Mean | 1005551.9 |
| Median Absolute Deviation (MAD) | 13631 |
| Skewness | -3.8353061 |
| Sum | 2.6194929 × 1011 |
| Variance | 4.8562036 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1017119 | 1692 | 0.6% |
| 987220 | 1334 | 0.5% |
| 1032084 | 1319 | 0.5% |
| 962808 | 1318 | 0.5% |
| 1005040 | 1298 | 0.5% |
| 1020232 | 1266 | 0.5% |
| 1041879 | 1235 | 0.5% |
| 1011750 | 1224 | 0.5% |
| 1026486 | 1219 | 0.5% |
| 997897 | 1189 | 0.5% |
| Other values (31262) | 247409 |
| Value | Count | Frequency (%) |
| 0 | 10 | < 0.1% |
| 913979 | 1 | < 0.1% |
| 914042 | 1 | < 0.1% |
| 914213 | 1 | < 0.1% |
| 914507 | 1 | < 0.1% |
| 914643 | 1 | < 0.1% |
| 914803 | 31 | |
| 914868 | 1 | < 0.1% |
| 914911 | 1 | < 0.1% |
| 914925 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1067220 | 1 | < 0.1% |
| 1067185 | 5 | |
| 1066815 | 1 | < 0.1% |
| 1066674 | 1 | < 0.1% |
| 1066636 | 8 | |
| 1066615 | 2 | < 0.1% |
| 1066560 | 2 | < 0.1% |
| 1066523 | 1 | < 0.1% |
| 1066431 | 1 | < 0.1% |
| 1066424 | 1 | < 0.1% |
Y_COORD_CD
Real number (ℝ)
High correlation 
| Distinct | 33189 |
|---|---|
| Distinct (%) | 12.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207816.82 |
| Minimum | 0 |
|---|---|
| Maximum | 271282 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 158969 |
| Q1 | 185644 |
| median | 206961 |
| Q3 | 235593 |
| 95-th percentile | 253814 |
| Maximum | 271282 |
| Range | 271282 |
| Interquartile range (IQR) | 49949 |
Descriptive statistics
| Standard deviation | 29500.727 |
|---|---|
| Coefficient of variation (CV) | 0.14195543 |
| Kurtosis | -0.77305126 |
| Mean | 207816.82 |
| Median Absolute Deviation (MAD) | 23850 |
| Skewness | -0.038671414 |
| Sum | 5.4136906 × 1010 |
| Variance | 8.7029288 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 183909 | 1689 | 0.6% |
| 212676 | 1334 | 0.5% |
| 216954 | 1319 | 0.5% |
| 174275 | 1318 | 0.5% |
| 234533 | 1306 | 0.5% |
| 210719 | 1266 | 0.5% |
| 197083 | 1235 | 0.5% |
| 250274 | 1224 | 0.5% |
| 262591 | 1219 | 0.5% |
| 175676 | 1176 | 0.5% |
| Other values (33179) | 247417 |
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 121508 | 1 | < 0.1% |
| 121900 | 1 | < 0.1% |
| 121929 | 1 | < 0.1% |
| 122258 | 1 | < 0.1% |
| 122533 | 1 | < 0.1% |
| 123092 | 1 | < 0.1% |
| 123163 | 2 | < 0.1% |
| 123321 | 1 | < 0.1% |
| 123357 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 271282 | 2 | < 0.1% |
| 271127 | 1 | < 0.1% |
| 270906 | 5 | |
| 270801 | 5 | |
| 270744 | 1 | < 0.1% |
| 270713 | 3 | |
| 270689 | 1 | < 0.1% |
| 270345 | 1 | < 0.1% |
| 270310 | 1 | < 0.1% |
| 270191 | 1 | < 0.1% |
Latitude
Real number (ℝ)
High correlation  Skewed 
| Distinct | 42791 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.735495 |
| Minimum | 0 |
|---|---|
| Maximum | 40.911236 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.60274 |
| Q1 | 40.67619 |
| median | 40.734681 |
| Q3 | 40.813303 |
| 95-th percentile | 40.863284 |
| Maximum | 40.911236 |
| Range | 40.911236 |
| Interquartile range (IQR) | 0.13711317 |
Descriptive statistics
| Standard deviation | 0.26504229 |
|---|---|
| Coefficient of variation (CV) | 0.0065064212 |
| Kurtosis | 21417.879 |
| Mean | 40.735495 |
| Median Absolute Deviation (MAD) | 0.065453734 |
| Skewness | -139.3701 |
| Sum | 10611556 |
| Variance | 0.070247415 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.671404 | 1584 | 0.6% |
| 40.762037 | 1266 | 0.5% |
| 40.810391 | 1260 | 0.5% |
| 40.644996 | 1252 | 0.5% |
| 40.750423 | 1243 | 0.5% |
| 40.744981 | 1204 | 0.5% |
| 40.853578 | 1179 | 0.5% |
| 40.707439 | 1165 | 0.4% |
| 40.887325 | 1164 | 0.4% |
| 40.648859 | 1103 | 0.4% |
| Other values (42781) | 248079 |
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 40.49994 | 1 | < 0.1% |
| 40.501018 | 1 | < 0.1% |
| 40.501092 | 1 | < 0.1% |
| 40.501975 | 1 | < 0.1% |
| 40.502754 | 1 | < 0.1% |
| 40.504259 | 1 | < 0.1% |
| 40.50447061 | 2 | < 0.1% |
| 40.504915 | 1 | < 0.1% |
| 40.5050145 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.911236 | 2 | < 0.1% |
| 40.91081 | 1 | < 0.1% |
| 40.91020132 | 5 | |
| 40.909915 | 5 | |
| 40.909767 | 1 | < 0.1% |
| 40.90967503 | 3 | |
| 40.90960757 | 1 | < 0.1% |
| 40.908615 | 1 | < 0.1% |
| 40.908567 | 1 | < 0.1% |
| 40.908193 | 1 | < 0.1% |
Longitude
Real number (ℝ)
High correlation  Skewed 
| Distinct | 42739 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.920132 |
| Minimum | -74.252711 |
|---|---|
| Maximum | 0 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 260489 |
| Negative (%) | > 99.9% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | -74.252711 |
|---|---|
| 5-th percentile | -74.017299 |
| Q1 | -73.976436 |
| median | -73.92417 |
| Q3 | -73.879026 |
| 95-th percentile | -73.791995 |
| Maximum | 0 |
| Range | 74.252711 |
| Interquartile range (IQR) | 0.09741 |
Descriptive statistics
| Standard deviation | 0.46430435 |
|---|---|
| Coefficient of variation (CV) | -0.0062811623 |
| Kurtosis | 24659.983 |
| Mean | -73.920132 |
| Median Absolute Deviation (MAD) | 0.049186534 |
| Skewness | 154.90834 |
| Sum | -19256121 |
| Variance | 0.21557853 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.881509 | 1584 | 0.6% |
| -73.827328 | 1266 | 0.5% |
| -73.924895 | 1260 | 0.5% |
| -74.077263 | 1252 | 0.5% |
| -73.98928 | 1244 | 0.5% |
| -73.870144 | 1204 | 0.5% |
| -73.900591 | 1179 | 0.5% |
| -73.792139 | 1165 | 0.4% |
| -73.847247 | 1164 | 0.4% |
| -73.95082 | 1103 | 0.4% |
| Other values (42729) | 248078 |
| Value | Count | Frequency (%) |
| -74.25271141 | 1 | < 0.1% |
| -74.252487 | 1 | < 0.1% |
| -74.251844 | 1 | < 0.1% |
| -74.25081 | 1 | < 0.1% |
| -74.250331 | 1 | < 0.1% |
| -74.24975495 | 31 | < 0.1% |
| -74.24952 | 1 | < 0.1% |
| -74.24935766 | 1 | < 0.1% |
| -74.249303 | 10 | < 0.1% |
| -74.249302 | 192 |
| Value | Count | Frequency (%) |
| 0 | 10 | |
| -73.70059685 | 1 | < 0.1% |
| -73.700717 | 4 | < 0.1% |
| -73.700719 | 1 | < 0.1% |
| -73.702045 | 1 | < 0.1% |
| -73.702535 | 1 | < 0.1% |
| -73.702646 | 8 | |
| -73.702756 | 2 | < 0.1% |
| -73.702966 | 2 | < 0.1% |
| -73.7030874 | 1 | < 0.1% |
| Distinct | 44251 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 22.3 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 28 |
| Mean length | 32.947723 |
| Min length | 11 |
Unique
| Unique | 18895 ? |
|---|---|
| Unique (%) | 7.3% |
Sample
| 1st row | POINT (-73.9410982410066 40.8009303727402) |
|---|---|
| 2nd row | POINT (-73.927554 40.833209) |
| 3rd row | POINT (-73.952863 40.778348) |
| 4th row | POINT (-73.905128 40.648698) |
| 5th row | POINT (-73.973717 40.763313) |
| Value | Count | Frequency (%) |
| point | 260499 | |
| 40.671404 | 1584 | 0.2% |
| 73.881509 | 1584 | 0.2% |
| 73.827328 | 1266 | 0.2% |
| 40.762037 | 1266 | 0.2% |
| 73.924895 | 1260 | 0.2% |
| 40.810391 | 1260 | 0.2% |
| 74.077263 | 1252 | 0.2% |
| 40.644996 | 1252 | 0.2% |
| 73.98928 | 1244 | 0.2% |
| Other values (85520) | 509030 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 759713 | 8.9% |
| 4 | 696425 | 8.1% |
| 0 | 631201 | 7.4% |
| 3 | 627318 | 7.3% |
| 9 | 543342 | 6.3% |
| 8 | 540252 | 6.3% |
| 520998 | 6.1% | |
| . | 520978 | 6.1% |
| 6 | 465600 | 5.4% |
| 5 | 418798 | 4.9% |
| Other values (10) | 2858224 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8582849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 759713 | 8.9% |
| 4 | 696425 | 8.1% |
| 0 | 631201 | 7.4% |
| 3 | 627318 | 7.3% |
| 9 | 543342 | 6.3% |
| 8 | 540252 | 6.3% |
| 520998 | 6.1% | |
| . | 520978 | 6.1% |
| 6 | 465600 | 5.4% |
| 5 | 418798 | 4.9% |
| Other values (10) | 2858224 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8582849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 759713 | 8.9% |
| 4 | 696425 | 8.1% |
| 0 | 631201 | 7.4% |
| 3 | 627318 | 7.3% |
| 9 | 543342 | 6.3% |
| 8 | 540252 | 6.3% |
| 520998 | 6.1% | |
| . | 520978 | 6.1% |
| 6 | 465600 | 5.4% |
| 5 | 418798 | 4.9% |
| Other values (10) | 2858224 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8582849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 759713 | 8.9% |
| 4 | 696425 | 8.1% |
| 0 | 631201 | 7.4% |
| 3 | 627318 | 7.3% |
| 9 | 543342 | 6.3% |
| 8 | 540252 | 6.3% |
| 520998 | 6.1% | |
| . | 520978 | 6.1% |
| 6 | 465600 | 5.4% |
| 5 | 418798 | 4.9% |
| Other values (10) | 2858224 |
Interactions
Correlations
| AGE_GROUP | ARREST_BORO | ARREST_KEY | ARREST_PRECINCT | JURISDICTION_CODE | KY_CD | LAW_CAT_CD | Latitude | Longitude | PD_CD | PERP_RACE | PERP_SEX | X_COORD_CD | Y_COORD_CD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AGE_GROUP | 1.000 | 0.025 | 0.013 | 0.013 | 0.016 | 0.068 | 0.063 | 0.004 | 0.004 | 0.086 | 0.062 | 0.023 | 0.006 | 0.020 |
| ARREST_BORO | 0.025 | 1.000 | 0.010 | 0.788 | 0.051 | 0.045 | 0.049 | 0.003 | 0.003 | 0.078 | 0.165 | 0.022 | 0.538 | 0.552 |
| ARREST_KEY | 0.013 | 0.010 | 1.000 | -0.004 | 0.020 | -0.002 | 0.013 | 0.002 | 0.000 | -0.007 | 0.011 | 0.011 | 0.000 | 0.002 |
| ARREST_PRECINCT | 0.013 | 0.788 | -0.004 | 1.000 | -0.070 | 0.005 | 0.048 | -0.472 | 0.383 | 0.026 | 0.142 | 0.013 | 0.384 | -0.472 |
| JURISDICTION_CODE | 0.016 | 0.051 | 0.020 | -0.070 | 1.000 | 0.123 | 0.020 | 0.027 | -0.018 | 0.098 | 0.028 | 0.016 | -0.018 | 0.027 |
| KY_CD | 0.068 | 0.045 | -0.002 | 0.005 | 0.123 | 1.000 | 0.723 | -0.010 | -0.002 | 0.156 | 0.038 | 0.074 | -0.002 | -0.010 |
| LAW_CAT_CD | 0.063 | 0.049 | 0.013 | 0.048 | 0.020 | 0.723 | 1.000 | 0.034 | 0.034 | 0.393 | 0.030 | 0.051 | 0.029 | 0.027 |
| Latitude | 0.004 | 0.003 | 0.002 | -0.472 | 0.027 | -0.010 | 0.034 | 1.000 | 0.279 | -0.047 | 0.010 | 0.000 | 0.279 | 1.000 |
| Longitude | 0.004 | 0.003 | 0.000 | 0.383 | -0.018 | -0.002 | 0.034 | 0.279 | 1.000 | -0.008 | 0.010 | 0.000 | 1.000 | 0.280 |
| PD_CD | 0.086 | 0.078 | -0.007 | 0.026 | 0.098 | 0.156 | 0.393 | -0.047 | -0.008 | 1.000 | 0.044 | 0.167 | -0.008 | -0.047 |
| PERP_RACE | 0.062 | 0.165 | 0.011 | 0.142 | 0.028 | 0.038 | 0.030 | 0.010 | 0.010 | 0.044 | 1.000 | 0.046 | 0.084 | 0.145 |
| PERP_SEX | 0.023 | 0.022 | 0.011 | 0.013 | 0.016 | 0.074 | 0.051 | 0.000 | 0.000 | 0.167 | 0.046 | 1.000 | 0.022 | 0.019 |
| X_COORD_CD | 0.006 | 0.538 | 0.000 | 0.384 | -0.018 | -0.002 | 0.029 | 0.279 | 1.000 | -0.008 | 0.084 | 0.022 | 1.000 | 0.279 |
| Y_COORD_CD | 0.020 | 0.552 | 0.002 | -0.472 | 0.027 | -0.010 | 0.027 | 1.000 | 0.280 | -0.047 | 0.145 | 0.019 | 0.279 | 1.000 |
Missing values
Sample
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | New Georeferenced Column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 281369711 | 01/30/2024 | 177.0 | SEXUAL ABUSE | 116.0 | SEX CRIMES | PL 1306501 | F | M | 25 | 0 | 25-44 | M | BLACK | 1000558 | 231080 | 40.800930 | -73.941098 | POINT (-73.9410982410066 40.8009303727402) |
| 1 | 284561406 | 03/30/2024 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | B | 44 | 0 | 25-44 | M | BLACK | 1004297 | 242846 | 40.833209 | -73.927554 | POINT (-73.927554 40.833209) |
| 2 | 284896016 | 04/06/2024 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | M | 19 | 0 | 25-44 | M | BLACK | 997304 | 222853 | 40.778348 | -73.952863 | POINT (-73.952863 40.778348) |
| 3 | 285569016 | 04/18/2024 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | K | 69 | 0 | 25-44 | M | BLACK | 1010576 | 175628 | 40.648698 | -73.905128 | POINT (-73.905128 40.648698) |
| 4 | 287308954 | 05/22/2024 | 464.0 | JOSTLING | 230.0 | JOSTLING | PL 1652501 | M | M | 18 | 0 | 18-24 | M | WHITE | 991530 | 217373 | 40.763313 | -73.973717 | POINT (-73.973717 40.763313) |
| 5 | 286793332 | 05/13/2024 | 155.0 | RAPE 2 | 104.0 | RAPE | PL 1303001 | F | Q | 112 | 0 | 18-24 | M | BLACK HISPANIC | 1025401 | 202586 | 40.722641 | -73.851542 | POINT (-73.8515418216779 40.7226409964758) |
| 6 | 279892607 | 01/03/2024 | 153.0 | RAPE 3 | 104.0 | RAPE | PL 1302503 | F | Q | 113 | 0 | 25-44 | M | BLACK | 1046315 | 187088 | 40.679981 | -73.776234 | POINT (-73.7762339071953 40.6799807384666) |
| 7 | 280263905 | 01/10/2024 | 157.0 | RAPE 1 | 104.0 | RAPE | PL 1303501 | F | B | 42 | 0 | 25-44 | M | BLACK | 1008690 | 238862 | 40.822271 | -73.911698 | POINT (-73.911697780277 40.8222710411331) |
| 8 | 288072319 | 06/06/2024 | 808.0 | TAX LAW | 125.0 | OTHER STATE LAWS | TAX18140B3 | F | M | 13 | 0 | 45-64 | M | BLACK | 987373 | 210805 | 40.745287 | -73.988729 | POINT (-73.98872939424497 40.7452870263689) |
| 9 | 288408753 | 06/12/2024 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | B | 52 | 0 | 45-64 | M | BLACK | 1012026 | 253649 | 40.862840 | -73.899580 | POINT (-73.89958 40.86284) |
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | New Georeferenced Column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 260493 | 298408181 | 12/23/2024 | 106.0 | ASSAULT POLICE/PEACE OFFICER | 106.0 | FELONY ASSAULT | PL 1200800 | F | M | 19 | 0 | 45-64 | F | WHITE | 996965 | 221508 | 40.774655 | -73.954093 | POINT (-73.95409255249892 40.77465542447417) |
| 260494 | 298193885 | 12/18/2024 | 101.0 | ASSAULT 3 | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1200001 | M | Q | 102 | 0 | 45-64 | M | BLACK | 1024721 | 187770 | 40.681978 | -73.854081 | POINT (-73.854081 40.681978) |
| 260495 | 298472501 | 12/26/2024 | 101.0 | ASSAULT 3 | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1200001 | M | Q | 101 | 0 | <18 | M | BLACK | 1053648 | 158969 | 40.602748 | -73.750082 | POINT (-73.750082 40.602748) |
| 260496 | 298690451 | 12/31/2024 | 139.0 | MURDER,UNCLASSIFIED | 101.0 | MURDER & NON-NEGL. MANSLAUGHTE | PL 1252501 | F | B | 43 | 0 | 18-24 | M | BLACK HISPANIC | 1020183 | 239282 | 40.823387 | -73.870170 | POINT (-73.87017 40.823387) |
| 260497 | 298299253 | 12/20/2024 | 439.0 | LARCENY,GRAND FROM OPEN AREAS, UNATTENDED | 109.0 | GRAND LARCENY | PL 1553001 | F | B | 52 | 0 | 25-44 | F | WHITE HISPANIC | 1010992 | 253610 | 40.862735 | -73.903320 | POINT (-73.90332024389409 40.86273501411426) |
| 260498 | 298287970 | 12/20/2024 | 339.0 | LARCENY,PETIT FROM OPEN AREAS, | 341.0 | PETIT LARCENY | PL 1552500 | M | K | 90 | 0 | 25-44 | M | WHITE HISPANIC | 998044 | 198865 | 40.712514 | -73.950245 | POINT (-73.950245 40.712514) |
| 260499 | 298401282 | 12/23/2024 | 439.0 | LARCENY,GRAND FROM OPEN AREAS, UNATTENDED | 109.0 | GRAND LARCENY | PL 1553001 | F | M | 24 | 0 | 45-64 | M | WHITE HISPANIC | 991558 | 226956 | 40.789615 | -73.973609 | POINT (-73.9736085726657 40.78961486176856) |
| 260500 | 298622307 | 12/30/2024 | 922.0 | TRAFFIC,UNCLASSIFIED MISDEMEAN | 348.0 | VEHICLE AND TRAFFIC LAWS | VTL05110MU | M | K | 67 | 0 | 25-44 | M | BLACK | 1003422 | 178505 | 40.656611 | -73.930902 | POINT (-73.93090206546258 40.65661089034527) |
| 260501 | 298335810 | 12/21/2024 | 269.0 | MISCHIEF,CRIMINAL, UNCL 2ND | 121.0 | CRIMINAL MISCHIEF & RELATED OF | PL 1450501 | F | Q | 115 | 0 | 25-44 | M | WHITE HISPANIC | 1020035 | 213111 | 40.751545 | -73.870843 | POINT (-73.87084320922126 40.75154455706598) |
| 260502 | 298548871 | 12/27/2024 | 681.0 | CHILD, ENDANGERING WELFARE | 233.0 | SEX CRIMES | PL 2601001 | M | B | 47 | 0 | 25-44 | M | BLACK | 1026480 | 262584 | 40.887314 | -73.847272 | POINT (-73.8472717577564 40.8873136344706) |